(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




(43) International Publication Date (•«) International Publication Number 

28 June 2001 (28.06.2001) PCT WO 01/46407 Al 



(51) International Patent Classification 7 : 
(21) International Application Number: 



C12N 15/11 



PCT/USOQ/33738 



(22) International Filing Date: 

1 2 December 2000 ( 1 2. 12.2000) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/171,566 



23 December 1999 (23.12.1999) US 



(71) Applicant: LEXICON GENETICS INCORPORATED 

[US/US]; 4000 Research Forest Drive, The Woodlands, TX 
77381 (US). 

(72) Inventors: WALKE, D., Wade; 7507 Danehill Drive, 
Spring, TX 77389 (US). TURNER, C, Alexander, Jr.; 
67 Winter Wheat Place, The Woodlands, TX 77381 (US). 
ABUIN, Alejandro; 19 Belcarra Place, The Woodlands, 
TX 77382 (US). FR1EDR1CH, Glenn; Breland & Bre- 
land, 2207 Hermann Drive, Houston, TX 77004 (US). 
ZAMBROWICZ, Brian; 1 8 Firethome Place, The Wood- 
lands, TX 77382 (US). SANDS, Arthur, T.; 163 Bristol 
Bend Circle, The Woodlands, TX 77382 (US). 



(74) Agents: ISHIMOTO, Lance, K. et al.; Lexicon Genetics 
Incorporated, 4000 Research Forest Drive, The Woodlands, 
TX 77381 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE. DK, PM DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, JP, KE, KG, KP. KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, UZ, VN, VU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS. MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY. KG, KZ, MD, RU, TJ, TM). European 
patent (AT. BE, CH. CY, DE, DK, ES, FI, FR, GB, GR, IE. 
IT, LU, MC, NL, PT, SE, TR). OAPI patent (BF, BJ, CF, 
CG, CI, CM. GA, GN, GW, ML, MR, NE. SN, TD, TG). 

Published: 

— With international search report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes ami Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette: 



(54) Title: POLYNUCLEOTIDES ENCODING HUMAN PROTEASE HOMOLOGS 



^ (57) Abstract: Novel human polynucleotide and polypeptide sequences are disclosed that can be used in therapeutic, diagnostic, 



and pharmacogenomic applications. 



WO 01/46407 



PCT/US00/33738 



POLYNUCLEOTIDES ENCODING HUMAN PROTEASE HOMOLOGS 

The present application claims the benefit of U.S. 
5 ■ Provisional Application Number 60/171,566 which was filed on 
December 22, 1999 and is herein incorporated by reference in 
its. entirety. 

1 . INTRODUCTION 
The present invention relates to the discovery, 

10 identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity 
with mammalian proteases. The invention encompasses the 
described polynucleotides, host cell expression systems, the 
encoded protein, fusion proteins, polypeptides and peptides, 

15 antibodies to the encoded proteins and peptides, and 

genetically engineered animals that either lack or over 
express the disclosed sequences, antagonists and agonists of 
the proteins, and other compounds that modulate the expression 
- or activity of the proteins encoded by the disclosed 

20 polynucleotides that can be used for diagnosis, drug 

screening, clinical trial monitoring and the treatment of 
physiological disorders. 

2. BACKGROUND OF THE INVENTION 
25 Proteases cleave protein substrates as part of 

degradation, maturation, and secretory pathways within the 
body. Proteases have been associated with,, -inter alia, 
regulating development, modulating cellular .processes, 
fertility, and infectious disease. 

30 

3. SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
. identification, and characterization of nucleotides that . 
encode novel human proteins, and the corresponding amino acid 
3 5 sequences of these proteins. The novel human proteins (NHPs) 
described for the first. time herein share structural . 
similarity with animal proteases, and particularly trypsin- 
like proteases such as , oviductin . 



WO 01/46407 PCT/USOO/33738 

The novel human nucleic acid (cDNA) sequences described, 
herein, encode a proteins /open reading frames (ORFs) of 306, 
3 02, and 164 amino acids in length (see SEQ ID NOS : 2, 4, and 
6 respectively) . 

5 The invention also encompasses agonists and antagonists 

of the described NHPs, including small molecules, large 
molecules, mutant NHPs, or portions thereof that compete with 
native NHPs, NHP peptides, and NHP antibodies, as well as 
nucleotide sequences that can be used to inhibit the 

10 expression of the described NHPs (e.g., antisense and ribozyme 
molecules, and gene or regulatory sequence replacement 
constructs) or to enhance the expression of the described NHPs 
(e.g., expression constructs that place the described sequence 
under the control of a strong promoter system) , and transgenic 

15 animals that express a NHP transgene, or "knock-outs" (which 
can be conditional) that do not express a functional NHP. 

Further, the present invention also relates to processes 
for identifying compounds that modulate, i.e.,' act as agonists 
or antagonists, of NHP expression and/or NHP activity that 

20 utilize purified preparations of the described NHP and/or NHP 
product, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 
variety of symptoms associated with biological disorders or 
imbalances . 

25 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the NHP 
ORFs encoding the described NHP amino acid sequences. SEQ ID 
NO: 7 describes an NHP ORF with flanking sequences. 

30 

5. DETAILED DESCRIPTION OF THE "INVENTION 
The NHPs, described for the first' time herein, are novel 
proteins that are expressed in, inter alia, human cell. lines, 
and human thymus, trachea, kidney, prostate, testis, thyroid, 
35 salivary gland, stomach, placenta, mammary gland, adipose, 

skin, esophagus, bladder, pericardium, and fetal kidney cells. 
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The described sequences were compiled from gene trapped 
cDNAs and clones isolated from a human kidney cDNA library 
(Edge Biosystems, Gaithersburg , MD) . The present invention 
encompasses the nucleotides presented in the Sequence Listing, 
5 host cells expressing such nucleotides, the expression 

products of such nucleotides, and: (a) nucleotides that encode 
mammalian homologs of the described sequences, including the 
specif ically -described NHPs, and the NHP products; 
(b) nucleotides that encode one. or more portions of a NHP that 

10 correspond to functional domains of the NHP, and the 

polypeptide products specified by such nucleotide sequences, 
including but not limited to the novel regions of any active 
domain(s); (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described 

15 NHP in which all or a part of at least one domain is deleted 
or altered, and the polypeptide products specified by such 
nucleotide sequences, including but not limited to soluble 
proteins and peptides in which all or a portion of the signal 
sequence is deleted; (d) nucleotides that encode chimeric 

20 fusion proteins containing all or a portion of a coding region 
of a NHP, or one of its domains (e.g., a receptor or ligand 
binding domain, accessory protein/ self -association domain, 
etc.) fused to another peptide or polypeptide; or (e) 
therapeutic or diagnostic derivatives of the described 

25 polynucleotides such as oligonucleotides, antisense 

polynucleotides, ribozymes, dsRNA, or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed above, the present invention includes: 
(a) the human DNA sequences presented in the Sequence Listing 

30 (and vectors comprising the same) and additionally 

contemplates any nucleotide sequence encoding a contiguous NHP 
open reading frame (ORF) , or a contiguous exon splice junction 
first described in the Sequence Listing, that hybridizes to a 
complement of a DNA sequence presented in the Sequence Listing 

35 under highly stringent conditions, e.g., hybridization to 
filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate 
(SDS) , 1 mM EDTA at 65 °C, and washing in 0.1xSSC/0.1% SDS at 
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68°C (Ausubel F.M.. et al . , eds . , 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & sons, Inc., New York, at p. 2.1G.3) and 
encodes a functionally equivalent gene product. Additionally 
5 contemplated are any nucleotide sequences that hybridize to 
. the complement of the DNA- sequence that encode and express an 
amino acid sequence presented in the Sequence Listing under 
moderately stringent conditions, e.g., washing in 0.2xSSC/0.1% 
SDS at 42°C (Ausubel et al., 1989, supra), yet still encode a 

10 functionally equivalent NHP product. Functional equivalents 
of a NHP include naturally occurring NHPs present in other 
species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, 
directed evolution as described in, for example, U.S. Patent 

15 No. 5,837,458). The invention also includes degenerate 
nucleic acid variants of the disclosed NHP polynucleotide 
sequences. 

Additionally contemplated are polynucleotides encoding a 
NHP ORF, or its functional equivalent, encoded by a 

20 polynucleotide sequence that is about 99, 95, 90, or about 85 
percent similar or identical to corresponding regions of the' 
nucleotide sequences of the Sequence Listing (as measured by 
BLAST sequence comparison analysis using, for example, the GCG 
sequence analysis package using standard default settings) . 

.25 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are therefore 
.the complements of (l the described NHP nucleotide sequences. 
Such hybridization conditions may be highly stringent or less 
highly stringent, as -described above. In instances where the 

30 nucleic acid molecules are deoxyoligonucleotides ("DNA 

oligos"), such molecules are generally about 16 to about 100 
bases long, or about 20 to about 80, or about 34 to about 45 
bases long, or any variation or combination of sizes 
represented therein that incorporate a contiguous region of 

35 sequence first disclosed in the Sequence Listing. Such 
oligonucleotides can be used in conjunction with the 
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polymerase chain reaction (PCR) to . screen libraries, isolate 
clones, and prepare cloning and sequencing templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing 
5 gene expression patterns (particularly using a micro, array or 
high- throughput "chip" format) ... Additionally, 1 a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 

10 sequence first disclosed in at least a portion of one or more 
of the sequences of SEQ ID NOS : .1-7 can be used as a 
• hybridization probe in conjunction with a solid support 
matrix/ substrate (resins, beads, membranes, plastics, 
polymers, metal or metallized substrates, crystalline or 

15 polycrystalline substrates, etc.). Of particular note are 
spatially addressable arrays ( i . e. , gene chips , microtiter 
plates, etc..) of oligonucleotides and. polynucleotides , or 
corresponding oligopeptides and polypeptides, wherein at least 
one of the biopolymers present on the spatially addressable 

20 array comprises an oligonucleotide or polynucleotide sequence 
first disclosed in at least one of the sequences of SEQ ID 
NOS: 1-7, ..or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon 

25 are disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 
4, 713, 326, 5, 424, 186, and 4,689,405 the disclosures of which 
are herein incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed 

30 -in SEQ ID NOS: 1-7 can be used to identify and characterize 
the temporal and tissue specific expression of a sequence. 
. These addressable arrays incorporate oligonucleotide sequences 
of sufficient length to confer the required specificity, yet 
be within the limitations of the production technology. The 

35 length of these probes is within a range of between about 8 to 
about 2000 nucleotides. Preferably the probes consist of 60 : 
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nucleotides and more preferably 25 nucleotides from the 
.sequences first disclosed in SEQ ID NOS:l-7. 

. For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
5 format to represent all' or a portion of the described 

sequences. The oligonucleotides, typically between about 16 
..to about 40 (or any whole number within the stated range) 
nucleotides in length can partially overlap each other and/or 
the sequence may be represented using oligonucleotides that do 

10 ' not overlap. Accordingly, the described polynucleotide 

sequences shall typically comprise at least about two or three 
distinct oligonucleotide sequences of at least about 8 .. 
nucleotides in length that, are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences 

15 can begin at any nucleotide present within a sequence in the ■ 
Sequence. Listing and proceed in either a sense (5'-to-3')- 
orientation vis-a-vis the described sequence or in an 
antisense orientation. v 

Microarray-based analysis allows the discovery of broad 

20 patterns of genetic activity,- providing new understanding of 
gene functions and generating novel and unexpected insight, 
into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first disclosed 
• in SEQ ID NOS: 1-7 provides detailed information about 

25 transcriptional changes involved in a specific pathway, 

potentially leading to' the identification of novel components 
or gene functions that manifest themselves as novel 
pheno types . ' . 

Probes consisting of sequences first disclosed in SEQ ID 

30 N0S:l-7 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The 
use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 

35 from the drugs intended target . These unique sequences ... 
therefore also have utility in defining and monitoring both 
drug action and toxicity. \ . ]•■ ' 
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As an example of utility, the sequences firsc disclosed 
in SEQ ID NOS : 1-7 can be utilized in microarrays or other 
assay formats, to screen collections of genetic material from 
patients who have a particular medical condition. These 
5 investigations can also be carried out using the sequences 
first disclosed in SEQ ID NOS : 1-7 in silico and by comparing 
previously collected genetic databases and the disclosed 
sequences using computer software known to those in the art. 

Thus the sequences first disclosed in SEQ ID NOS: 1-7 can 

10 be used to identify mutations associated with a particular 
disease and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be 

15 described using any of a wide variety of additional structural 
attributes, or combinations thereof. For example, a given 
sequence can be described by the net composition of the 
nucleotides present within a given region of the sequence in 
conjunction with the presence of one or more specific 

20 oligonucleotide sequence (s) first disclosed in the SEQ ID NOS: 
1-7. Alternatively, a restriction map specifying the relative 
positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide 
sequences can be used to structurally describe a given 

25 sequence. Such restriction maps, which are typically 

generated by widely available computer programs (e.g., the 
University of Wisconsin GCG sequence analysis package, 
SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can 
optionally be used in conjunction with one or more discrete 

30 nucleotide sequence (s) present in the sequence that can be 

described by the relative position of the sequence relatve to 
one or more additional sequence (s) or one or more restriction 
sites present in the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions 

35 may refer, e.g., to washing in 6xSSC/0.05% sodium . 

pyrophosphate at 37°C (for 14-base oligos) ; 48°C (for 17-base 
oligos) , 55°C (for 20-base oligos), and 60°C (for 23-base 
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oligos) . These nucleic acid molecules may encode or acn as 
NHP sequence antisense molecules, useful, for example, in NHP 
gene regulation (for and/or as ancisense primers in 
amplification reactions of NHP gene nucleic acid sequences) . 
5 With respect to. NHP gene regulation, such techniques can be 
used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
10 can additionally comprise at least one modified base moiety 
which is selected from the group including but not limited to 
5-f luorouracil, 5-bromouracil, 5-chlorouracil , 5-iodouracil , 
hypoxanthine, xantine, 4-acetylcytosine, 

5- (carboxyhydroxylmethyl) uracil, 5 r carboxymethylaminomethyl- 

15 2-thiouridine, 5-carboxymethylaminomethyluracil , 
dihydrouracil, beta-D-galactosylqueosine, inosihe, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 
2, 2-dimethylguanine, 2-methyladenine, 2-methylguanine , 
3-methylcytosine, 5 -me thy Icy to sine, N6-adenine, 

20 7-methylguanine, 5-methylaminomethyluracil, 

5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarboxymethyluracil, 5-methoxyuracil , 
2 -methyl thio-N6-isopentenyladenine, uracil- 5-oxyacetic acid 
(v) , wybutoxosine, pseudouracil , queosine, 2-thiocytosine, 

25 5-methyl-2-thiouracil, 2-thiouracil , 4-thiouracil , 

5-methyluracil, uracil-5-oxyacet'ic acid methylester /. uracil- 
5-oxyacetic acid (v) , 5-methyl-2-thiouracil , 3- (3-amino-3-N-2- 
carboxypropyl ) uracil, (acp3)w, and 2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least 

30 one modified sugar moiety selected from the group including 

but not limited to arabinose, 2-f luoroarabinose, xylulose, and 
hexose . . . 

In yet another embodiment, the antisense oligonucleotide 
will comprise- at least one modified phosphate . backbone : • 

35 selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, \ a phosphoramidate , 
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a phosphordiamidate, a methylphosphonate, an alkyl 
phosphotriester , and. a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
5 oligonucleotide forms specific double- stranded hybrids with 
complementary RNA in which, contrary to the usual 3-units, the 
strands run parallel to each other (Gautier et al . , 1987, 
Nucl. Acids Res. 25:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al.., 1987, Nucl. Acids Res. 
10 15: 613i-6148) , or a chimeric RNA-DNA analogue (Inoue et al . , 

1987, FEBS Lett. 225:327-330). Alternatively, double stranded 
RNA can be used to disrupt the expression and function of a 
targeted NHP . 

Oligonucleotides of the invention can be synthesized by 

15 standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al. (1988, Nucl. Acids Res. 26:3209), and 

20 methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et al . , 1988, 
Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 

25 specific organisms from which the library and the labeled 

sequences are derived. For guidance regarding such conditions 
see, for example, Sambrook et al . , 1989, Molecular Cloning, A 
Laboratory Manual (and periodic updates thereof), Cold Springs 
Harbor Press, N.Y. ; and Ausubel et al., 1989, Current 

30 Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience, N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can 
be used to screen a human genomic library using appropriately 
'"stringent conditions or by PCR. The identification and 

35 characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
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polymorphisms, or coding single nucleotide polymorphisms), 
determining the genomic structure of a given locus/allele, and 
•designing diagnostic tests . For example, sequences derived 
from regions adjacent to the intron/exon boundaries of the 
5 human gene can be used to design primers for use in 

amplification assays to detect mutations within the exons, 
introns, splice sites (e.g., splice acceptor and/ or donor 
sites), etc., that can be used in diagnostics and 
pharmacogenomics . 

10 Further, a NHP homolog can be isolated from nucleic acid 

from an organism of interest by performing PCR using two 
degenerate or "wobble" oligonucleotide primer pools designed 
on the basis of amino acid sequences within the NHP products 
disclosed herein. The template for the reaction may be total 

15 RNA, mRNA, and/or cDNA obtained by reverse transcription of 
mRNA prepared from human or non-human cell lines or tissue 
known or suspected to express an allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure 
that the amplified sequences represent the sequence of the 

20 desired NHP. The PCR fragment can then be us§d$.£o -isolate a 
full length cDNA clone by a variety of methods. For example, 
the amplified fragment . can be labeled and used to screen a 
cDNA library, such as a bacteriophage cDNA library. 
Alternatively, the labeled fragment can be used to isolate 

25 genomic clones via the screening of a genomic library. 

PCR technology can also be used to isolate' full length 
cDNA sequences. For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or. tissue 
source (i.e., one known, or suspected, to express a NHP gene, 

30 such as, for example, testis tissue) . A reverse transcription 
(RT) reaction can be performed on the RNA using an 
oligonucleotide primer specific for the most 5' end of the 
amplified fragment for the priming of first strand synthesis. 
The resulting RNA/DNA hybrid may then be "tailed" using a 

35 standard terminal transferase reaction, the hybrid may be 

digested with RNase H, and second strand synthesis may then be 
primed with a complementary primer. Thus, cDNA sequences 
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upstream of the amplified fragment can be isolated. For a 
review of cloning strategies that can be used, see e.g., 
Sambrook et al . , 1989, supra. 

A cDNA encoding a mutant NHP gene can be isolated, for . 
5 example, by using PCR. In this case, the first cDNA strand 
may be synthesized by hybridizing an oligo-dT oligonucleotide . 
to mRNA isolated from tissue known or suspected to be 
expressed in an individual putatively carrying a mutant NHP 
allele, and by extending the new strand with reverse 
10 transcriptase. The second strand of the cDNA is then 
synthesized using an oligonucleotide that hybridizes 
specifically to the 5' end of the normal gene. Using these 

• two primers,, the product is then amplified via PCR, optionally 
cloned into a suitable vector, and subjected to DNA sequence 

15 analysis through methods. well known to those of skill in the 
art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the 
mutant NHP gene product can be ascertained. 

20 Alternatively, a genomic library can be constructed using 

DNA obtained from an individual suspected of or known to carry 

• a mutant NHP allele (e.g., a person manifesting a NHP- 

• associated phenotype such as, for example, obesity, high blood 
pressure, connective tissue disorders, infertility, etc.), or 

25 a cDNA library can be constructed using RNA from a tissue 

known, or suspected, to express a mutant NHP allele. A normal 
NHP gene, or any suitable fragment thereof, can then be 
labeled and used as . a probe to identify the corresponding 
mutant NHP allele in such libraries. Clones containing mutant 

30 NHP gene sequences can then be purified and subjected to 
sequence analysis according to methods well known to those 
skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized f rom, f or example, RNA isolated 

35 from a tissue known, or suspected, to express a mutant NHP 

allele in an individual suspected of or known to carry such a 
mutant allele. In this manner, gene products made by the 

11 
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putatively mutant tissue can be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against normal NHP product, as described . 
below. (For screening techniques , see, for example, Harlow, . 
E. and Lane, eds . , 1988, "Antibodies: A Laboratory Manual", 
Cold Spring Harbor Press,' Cold Spring Harbor.) 

Additionally, screening can be accomplished by screening with' 
labeled NHP fusion. proteins , such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 
In cases where a NHP mutation results in an expressed gene 
product with altered function (e.g., as a result of a missense 
or a frameshift mutation) , polyclonal antibodies to NHP are 
likely to cross-react with a corresponding mutant NHP gene 
product. Library clones detected via their reaction with such 
labeled antibodies can be purified and subjected to sequence 
analysis according to methods well known in the art. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 
contain any of the foregoing NHP coding sequences operatively 
.associated with a regulatory element that directs the 
expression of the coding sequences (for example, baculo virus 
as described in U.S. Patent No.' 5,869,336 herein incorporated 
by reference) ; (c) genetically engineered host cells that 
contain any. of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of the coding sequences in the host cell; and (d) 
genetically engineered host cells that express an endogenous 
NHP sequence under the control of an exogenously introduced 
regulatory element (i.e., gene activation). As used herein, 
regulatory elements include, but are not limited to, inducible 
and non- inducible promoters, enhancers, operators and other .; 
..elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are 
not limited to the human cytomegalovirus (hCMV) immediate 
early gene, regulatable, viral elements (particularly 
retroviral LTR promoters) , the early or late promoters of SV40 
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adenovirus, the lac system, the trp system, the TAC system, 
the TRC system, the major operator and promoter regions of 
phage lambda, the control regions of fd coat protein, the 
promoter for 3-phosphoglycerate kinase (PGK) , the promoters of 
acid phosphatase, and the promoters of the yeast a-mating 
factors . 

The present invention also encompasses antibodies and- 
anti-idiotypic antibodies (including Fab fragments) , 
antagonists and agonists of a NHP, as well as compounds or 
nucleotide constructs that inhibit expression of a NHP gene 
(transcription factor inhibitors, antisense and ribozyme 
molecule's, or gene or regulatory sequence replacement 
constructs), or promote the expression of a NHP (e.g., 
expression constructs in which NHP coding sequences are 
15 operatively associated with expression control elements such 
as promoters, promoter / enhancers , etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs or inappropriately 
20 expressed NHPs for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, 
host cell expression systems, antibodies, antagonists, V 
agonists and genetically engineered cells and animals can be 
used for screening for drugs (or high throughput screening of 
25 combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the 
normal function of a NHP in the body. The use of engineered 
host cells and/or animals may offer an advantage in that such • 
systems allow not only for the identification of compounds 
30 that bind to the endogenous receptor for a NHP., but 'can also 
identify compounds that trigger NHP-mediated activities or 
pathways . 

Finally, ..the NHP products can be used as therapeutics. 
For example, soluble derivatives such as NHP peptides /domains 
35 corresponding to NHP, NHP fusion protein products. (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of- 
a NHP, to an IgFc) , NHP antibodies and anti-idiotypic 
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antibodies (including Fab fragments) , antagonises or agonists 
(including compounds that modulate or act on downstream 
targets .in a NHP-mediated pathway) can be used to directly 
treat diseases or disorders. For instance, the administration 
5 of an effective amount of soluble NHP, or a NHP-IgFc fusion 
protein or an anti-idiotypic antibody (or its Fab) that mimics 
the NHP could activate or effectively antagonize the 
endogenous NHP receptor. Nucleotide constructs encoding such 
NHP products can be used to genetically engineer host cells to 

10 express such products in vivo; these genetically engineered 
cells function as "bioreactors " in the body delivering a 
continuous supply of a NHP, a NHP peptide, or a NHP fusion 
protein to the body. Nucleotide constructs encoding 
functional NHP, mutant NHPs , as well as antisense and ribozyme 

15 molecules can also be used in "gene therapy" approaches for 
the modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for 
.treating biological disorders. 

Various aspects of the invention are described in greater 

20 detail in the subsections below. 

5.1 THE NHP SEQUENCES 
The cDNA sequences (SEQ ID NO: 1, 3, and 5) and the 
corresponding .deduced amino acid sequences of the described 

25 NHP are presented in the Sequence Listing. SEQ ID NO: 7 
■ describes a NHP ORF as well as flanking regions. The NHP 
nucleotides were- obtained from human cDNA libraries using 
probes and/ or primers generated from human gene trapped 
sequence tags. Expression analysis has provided evidence that 

30 the described NHP can be expressed a variety of human cells as 
well as gene trapped human cells. In addition, the described 
NHP sequences can contain a variety of polymorphisms such as 
at nucleotide 68 of SEQ ID NO:l and nucleotide 56 of SEQ ID . 
NO: 3 which both can be a G or an A that can give. rise to 

35 corresponding arg or gin at amino acid position 23 of SEQ ID 
NO: 2, or residue 19 of SEQ ID NO: 4. The described NHP 
sequences can also contain A-G polymorphisms at nucleotide 82 
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of SEQ ID NO:l and nucleotide 7 0 of SEQ ID NO: 3 which can give 
rise to a corresponding ala or thr at amino acid position 28 
of SEQ ID NO: 2, or residue 24 of SEQ ID NO: 4. The described 
NHPs share similarity with trypsin-like proteases, plasminogen 
activators, and human plasma kallikrein precursor. 

5.2 NHPs AND NHP POLYPEPTIDES 
NHPs, polypeptides, peptide fragments, mutated, 
truncated, or deleted forms of the NHPs, and/or NHP fusion 
proteins can be prepared for a variety of uses. These uses 
include, but are not limited to, the generation of antibodies, 
as reagents in diagnostic assays, for the identification of 
other cellular gene products related to a NHP, as reagents, in 
assays for screening for compounds that can be as 
pharmaceutical reagents useful in the therapeutic treatment of 
mental, biological, or medical disorders and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs 
display initiator methionines in DNA sequence contexts 
■ consistent with a translation initiation site, and display a 
consensus signal sequence. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well 
as analogues and derivatives thereof, as well as any 
oligopeptide sequence of at least about 10-40, generally about 
12-35, or about 16-30 amino acids in length first disclosed in 
the Sequence Listing. Further, . corresponding NHP homologues 
from other species are encompassed by the invention. In fact, 
any NHP encoded by the NHP nucleotide sequences described 
above are within the scope of the invention, as are any novel 
polynucleotide sequences encoding all or any novel portion of 
an amino acid sequence presented in the Sequence Listing. The 
degenerate nature of the genetic code is well known, and, 
accordingly, each amino- acid presented in the Sequence 
Listing, is generically representative of the well known 
nucleic acid "triplet" codon, or in many cases codons, that 
can encode the amino acid. As such, as contemplated herein, 
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the amino acid sequences presented in the Sequence Listing, 
when taken together with the genetic code (see, for example, 
Table 4-1 at page 109 of "Molecular Cell Biology", 1986, J. 
Darnell et al . eds., Scientific American Books, New York, NY, 
5 herein incorporated by reference) are generically 
representative of all the various permutations and 
combinations of nucleic acid sequences that can encode such 
amino acid sequences. 

The invention also encompasses proteins that are 

10 functionally equivalent to the NHPs encoded by the presently 
described nucleotide sequences as judged by any of a number of 
criteria, including, but not limited to, the ability to bind 
and cleave a substrate of a NHP, or the ability to effect an 
k identical or complementary downstream pathway, or a change in 

15 cellular metabolism (e.g., proteolytic activity, ion flux, 

tyrosine phosphorylation, etc.). Such functionally equivalent 
NHP proteins include, but are not limited to, additions or 
substitutions of amino acid residues within the amino acid 
sequence encoded by the NHP nucleotide sequences described 

20 above, but which result in a silent change, thus producing a 
functionally equivalent gene product. Amino acid 
substitutions can be made on the basis of similarity m 
polarity, charge, solubility, hydrophobic ity, hydrophilicity, 
and/or the amphipathic nature of the residues involved. For 

25 example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glu.tamine; positively charged (basic) amino acids include 

3 0 arginine, lysine, and histidine; and negatively charged 

(acidic) amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used 
to express the NHP nucleotide sequences of the invention. 
Where, as in the present instance, the NHP products or NHP 

3 5 polypeptides are thought to be soluble or secreted molecules, 
the peptide or polypeptide can be recovered from the culture 
media. Such expression systems also encompass engineered host 
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cells that express a NHP, or a functional equivalent, in situ. 
Purification or enrichment of NHP from such expression systems 
can be accomplished using appropriate detergents and lipid 
micelles and methods well known to those skilled in the art. 
5 However, such engineered host cells themselves may be used in 
situations where it is important not only to retain the 
structural and functional characteristics of the NHP, but to 
assess biological activity, e.g., in drug screening assays. 
The expression systems that may be used for purposes of 

10 the invention include but are not limited to microorganisms 

such as bacteria (e.g., E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
1 (e.g. , Saccharomyces , Pichia) transformed with recombinant 

15 yeast expression vectors containing NHP encoding nucleotide 

sequences; insect cell systems infected with recombinant virus 
expression vectors (e.g., baculovirus) containing NHP 
sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; 

20 tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors (e.g., Ti plasmid) containing NHP 
nucleotide sequences; or mammalian cell systems (e.g., COS, 
CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 

25 mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 

30 the NHP product being expressed. For example, when a large 
quantity of such a protein is to be produced for the 
generation of pharmaceutical compositions of or containing 
NHP, or for raising antibodies to a NHP, vectors that direct 
the expression of high levels of fusion protein products that 

35 are readily purified may be desirable. Such vectors include, 
but are not limited, to the E. coli expression vector pUR278 
(Ruther et al . , 1983, EMBO J. 2:1791), in which a NHP coding 

- ■ . • ■ . ' . v 
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sequence may be ligated individually inco the vector in frame 
with the lacZ coding region so' that a fusion protein is 
produced; pIN vectors ( Inouye & "Inouye, '1985, Nucleic Acids 
Res. 13:3101-3109; Van Heeke & Schuscer, 1989-, J. Biol. Chem. 
5 254:5503-5509); and the like. pGEX vectors (Pharmacia or 

American Type Culture Collection) can also be used to express 
foreign polypeptides as fusion proteins with glutathione 
S-transferase (GST).. In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by 

10 adsorption to glutathione-agarose beads followed by elution in 
the presence of free glutathione. The PGEX vectors are ■ 
designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target gene product can be released 
from the GST moiety. 

i5 In an insect system,- Autographa calif ornica nuclear 

polyhidrosis virus (AcNPV) is. used as a vector to express 
foreign genes. The virus grows in ' Spodoptera frugiperda 
cells. A NHP coding sequence can.be cloned individually into 
non-essential regions (for. example the polyhedrin gene) of the 

20 virus' and placed under control of an AcNPV promoter (for 

example the polyhedrin promoter) . Successful insertion of. NHP 
coding sequence will result in inactivation of the polyhedrin 
gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the 

25 .polyhedrin gene). These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted 
sequence is expressed (e.g., see Smith et al . , 1983, J. 
Virol. 46: 584; Smith, U.S. Patent No . 4 , 21 5 , 051 ) . 

In mammalian host cells, a number of viral-based 

30 expression systems may be utilized. In cases where an 

adenovirus is used as an expression vector, the NHP nucleotide 
sequence of interest may be ligated to an adenovirus ' 
transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric 

35 sequence may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential 
region of the viral genome (e.g., region El or E3) will result 
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in a recombinant virus that is viable and capable of 
• expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc . Natl. Acad. Sci . USA 51:3655-3659). 
Specific initiation signals may also be required for efficient 
5 translation of inserted NHP nucleotide sequences. These 
signals include the ATG initiation .codon and adjacent 
sequences. In cases where an entire NHP gene or cDNA, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no additional 

■'■ 10 translational control signals may be needed. . However, in 
cases where only a portion of a NHP coding sequence is 
inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. 
Furthermore,, the initiation codon must be in phase with the 
15 reading frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of 
a variety of origins, both natural and synthetic. The 
efficiency of expression may be enhanced by the inclusion. of 

■".'20 appropriate transcription enhancer elements, transcription 
terminators, etc. (See Bittner et al . , 1987, Methods in 
Enzymol. 153:516-544).'. 

In addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or 
.25 modifies and processes the gene product in the specific 

fashion desired. Such modifications (e.g., glycosylation) and 
processing (e.g., cleavage) of protein products may be , 
important for the function of the protein. Different host 
cells have characteristic and specif ic mechanisms - for the . 
30 post-translational processing and modification of proteins and 
gene products. Appropriate cell lines or,. host systems can be 
chosen to ensure the correct modification and processing of 
the foreign protein expressed. To this end, eukaryotic host 
cells which possess the cellular machinery for proper 
35 processing of the primary transcript, glycosylation, and" 
phosphorylation of . the gene product may be used.- Such - 
mammalian host cells include, but are not limited to, CHO, 
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VERO, BHK, HeLa, COS, MDCK, 2S3, 3T3, WI38, and in particular, - 
human cell lines. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
5 lines which stably express the NHP sequences described above 
can be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 

10 transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. ' Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in 
an enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 

15 resistance to the selection and allows cells to stably 

integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines 
which express the NHP product. Such engineered cell lines may 

20 be particularly useful in screening and evaluation of 

compounds that affect the endogenous activity of the NHP 
product . 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
25 (Wigler, et al . , 1911, Cell 21:223), hypoxanthine- guanine 

phosphdribosyltransferase (Szybalska & Szybalski, 1962, Proc. 
Natl. Acad. Sci. USA 48:2026), and adenine 

phosphoribosyltransferase (Lowy, et al., 1980, Cell 22:817) 
genes can be employed in tk~, hgprt" or aprt" cells, 

30 respectively. Also, antimetabolite resistance can be used as - 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
Natl. Acad. Sci. USA 77:3567; O'Hare, et al . , 1981, Proc. 
Natl. Acad. Sci . USA 75:1527); gpt, which confers resistance 

35 to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072); neo,. which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al . , 1981, J. Mol. 



20 



WO 01/46407 PCT/US00/33738 

Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al . , 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified 
by utilizing an antibody specific for the fusion protein being 
5 expressed. For example, a system described by Janknecht et 
al. allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al . , ' 
1991, Proc. Natl. Acad. Sci. USA 83:8972-8976). In this 
system, the sequence of interest . is subcloned into a vaccinia 

10 recombination plasmid such that the gene's open reading frame 
is translationally. fused to an amino -terminal tag consisting 
of six histidine residues. Extracts from cells infected with 
recombinant vaccinia virus are loaded onto Ni 2+ -nitriloacetic 
acid-agarose columns and histidine-tagged proteins are 

15 selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are novel protein 
constructs engineered in such a way that they facilitate 
transport of the NHP to the target site,' to the desired organ, 
across the cell membrane and/or to the nucleus where the NHP 

20 can exert its function activity. This goal may be achieved by 
coupling of the NHP to a cytokine or other ligand that would 
direct the NHP to the target organ and facilitate receptor 
mediated transport across the membrane into the cytosol . 
Conjugation of NHPs to antibody molecules or their Fab 

25 fragments could' be used to target cells bearing a particular 
epitope? Attaching the appropriate signal sequence to the NHP 
would also transport the NHP to the desired location within 
the cell. Alternatively targeting of NHP or its nucleic acid 
sequence might be achieved using liposome or lipid complex 

30 based delivery systems . Such technologies are described in 

Liposomes :A P^rt-ira-l- Abroach , New RRC ed. , Oxford University 
Press, New York and in U.S. Patents Nos . 4,594,59 5, 5,459,127, 
5,948,767 and 6, 110,490 and their 5 respective disclosures which 
are herein incorporated by reference in their entirety. 
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5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more 
epitopes of a NHP, or epitopes of conserved variants of a NHP, 
or peptide fragments of a NHP are also encompassed by the 
invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs) , humanized 
or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab') 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and ' 
epitope-binding fragments of any of the above. 

The antibodies of the invention may be used, for example, 
in the detection of NHP in a biological sample and may, 
therefore, be utilized as part of a diagnostic or prognostic 
technique whereby patients may be tested for abnormal amounts 
of NHP. Such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 
evaluation of the effect of test compounds on expression 
and/or activity of a NHP gene product. Additionally, such 
antibodies can be used in conjunction gene therapy to, for 
example, evaluate the normal and/or engineered NHP- expressing 
cells prior to their introduction into the patient. Such 
antibodies may additionally be used as a method for the 
inhibition of abnormal NHP activity. Thus, such antibodies 
may, therefore, be utilized as part of treatment methods. 

For the production of antibodies, various host animals 
may be immunized by injection with the NHP, an NHP peptide 
(e.g., one corresponding the a functional domain of an NHP), 
truncated NHP polypeptides (NHP in which one or more domains 
have been deleted) , functional equivalents of the NHP or 
mutated- variant of the NHP. Such host animals may include but 
are not limited to pigs, rabbits, mice, goats, and rats, to 
name but a few.. Various adjuvants may be used to increase the 
immunological response, depending on the host species, - 
including but not limited to Freund's adjuvant (complete and 
incomplete) , mineral salts such as aluminum hydroxide or 
aluminum phosphate, surface active substances such as 
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lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum . 
Alternatively, the immune response could be enhanced by 
5 combination and or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diptheria toxoid, 
ovalbumin, cholera toxin or fragments thereof. Polyclonal 
antibodies are heterogeneous populations of antibody molecules 
derived from the sera of the immunized animals. 

10 Monoclonal antibodies, which are homogeneous populations 

of antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 

15 Milstein, (1975, Nature 255:495-497; and U.S. Patent No. 

4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4:72; Cole et al . , 1983, Proc . 
Natl. Acad. Sci. USA 80:2026-2030), and the EBV-hybridoma 
technique (Cole et al . , 1985, Monoclonal Antibodies And Cancer 

20 Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 
be of any immunoglobulin class including IgG, IgM, IgE, IgA, . 
IgD and any subclass thereof. The hybridoma producing the mAb 
of this invention may be cultivated in vitro or in vivo. 
Production of high titers of mAbs in vivo makes this the 

25 presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al . , 1984, Proc. Natl. 
Acad. Sci., 81:6851-6855; Neuberger et al, , 1984, Nature, 
312:604-608; Takeda et al. . 1985, Nature, 314:452-454) by 

30 splicing the genes from a mouse antibody molecule of 

appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can 
be used. A chimeric antibody is a molecule in which different 
portions are derived from different animal species, such as 

35 those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. Such technologies are 
described in U.S. Patents Nos. 6,075,181 and 5,877,397 and 
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their respective disclosures which are herein incorporated by 
reference in their entirety. . 

Alternatively, techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 
5 Science 242:423-426 ; Huston et al . , 1988, Proc . Natl . Acad. 
Sci. USA 85:5879-5883 ; and Ward et al . , 1989, Nature 334:544- 
546) can be adapted to produce single chain antibodies against 
NHP gene products. Single chain antibodies are formed by 
linking the heavy and light chain fragments of the Fv region 
10 via an amino acid bridge, resulting in a single chain 
polypeptide. 

•Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such fragments 
include, but are not limited to : the F (ab' ) 2 fragments which 

15 can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 
expression libraries may be constructed (Huse et al . , 1989, 
Science, 246:1275-1281) to allow rapid and easy identification 

20 of monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in- turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well known to those skilled in the art. (See, 
e.g., Greenspan & Bona, 1993, FASEB J 7 (5) : 437-444 ; and 

25 Nissinoff, 1991, J. Immunol. 147 (8) : 2429-2438 ) . For example 
antibodies which bind to a NHP domain and competitively 
inhibit the binding' of NHP to its cognate receptor can be used 
to generate anti-idio types that "mimic" the NHP and, 
therefore, bind and activate or neutralize a receptor. Such 

30 anti-idiotypic antibodies or Fab fragments of such anti- 
idiotypes can be used in therapeutic regimens involving a NHP 
signaling pathway. 

The- present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 

35 as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within .the scope of the invention. Indeed, various 
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modifications of the invention, in addition to those shown and 
described herein will become apparent to those skilled in the 
art from the foregoing description. Such modifications are 
intended to fall within the scope of the appended ' claims . All 
cited publications, patents, and patent applications are 
herein incorporated by reference in their entirety. 
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1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first 

5 disclosed in the NHP polynucleotide described in. SEQ ID NO: 1. 

2 . An isolated nucleic acid molecule comprising a 
nucleotide sequence that:. - 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 2; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof. 

15 ,3..- An- isolated nucleic acid molecule encoding the 

amino acid sequence described in SEQ ID NO: 2. 
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SEQUENCE LISTING 

<110> Walke, D. Wade , 

Turner, C. Alexander Jr. 
Abuin, Alejandro 
Friedrich, Glenn 
Zambrowicz , Brian 
Sands, Arthur T. 

<120> Novel Human Proteases and 
Polynucleotides Encoding the Same 

<130> LEX-0108-PCT 

<150> US 60/171,566 
<151> 1999-12-22 

<160> 7 



<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 921 

<212> DNA 

<213> Homo sapiens 



<400> 1 

atgagtctca 

tttgaacrag 

ctggttaagg 

caagtggaga 

tgtggaggaa 

agaaacattg 

ccaggagagc 

aaaccaatgg 

tttgtggggc 

acaactgcag 

gtgaatctgc 

cccatcagtg 

cagggagatt 

ggtgtgactt 

gatcaaggat 

cacatccaaa 



aaatgcttat 
gtaaatctgc 
tacagccttg 
agggttccta 
gcatcgtctc 
tgtctacttt 
aaactctcac 
actatgatat 
ccatatgtct 
gctggggccg 
ctattttgac 
ggaagacctt 
caggaggttc 
cctggggttt 
cccctgggat 
ctggtaacta 



aagcaggaac 
arctctttcg 
gaattatttt 
tccctggcag 
accacagtgg 
gaatgttact 
tattgaaact 
tgcccttttg 
tccagagctg 
cttaactgaa 
ctgggaagag 
tctttgcaca 
actcatgtgc 
gggctgtggt 
cttcacagac 
a 



aagctgattt 
ctccccaaag 
aacattttca 
gtatctctga 
gtgatcacgg 
gctggagagt 
gtcatcatac 
aagatggctg 
cgggagcaat 
ggtggcgtcc 
tgtgtggcag 
ggttttcctg 
cggaataaga 
cgaggctgga 
attagtaaag 



tactactagg 


aatagtcttt 


60 


ctcccagttg 


tgggcagagt 


120 


gtcgcattct 


tggaggaagc 


180 


aacaaaggca 


gaagcatatt' 


240 


cggctcactg 


cattgcaaac 


300 


atgacttaag 


ccagacagac 


360 


atccacattt 


ctccaccaag . 


420 


gagccttcca 


atttggccac 


480 


ttgaggctgg 


ttttatttgt 


540 


tctcacaagt 


cttgcaggaa 


600 


ctctgttaac 


actaaagagg 


660 


atggagggag 


agacgcatgt 


720 


aaggggcctg 


gactctggct 


780 


gaaacaatgt 


gaggaaaagt 


840 


tgctttcctg 


gatccacgaa 


900 
921 



<210> 2 

<211> 306 

<212> PRT 

<213> Homo sapiens 



<400> 2 


























Met Sef 


Leu Lys 


Met 


Leu 


He 


Ser 


Arg Asn 


Lys 


Leu 


He 


Leu 


Leu 


Leu 


1 


5 








10 










15 




Gly He 


Val. Phe 


Phe' 


Glu 


Arg 


Gly 


Lys Ser 


Ala 


Ala. 


Leu 


Ser 


Leu 


Pro 


20 










25 








30 






Lys Ala 


Pro Ser 


Cys 


Gly 


Gin 


Ser 


Leu Val 


Lys 


Val 


Gin 


Pro 


Trp 


Asn 




35 








.40. ' 








45 








Tyr Phe 


Asn He 


Phe 


Ser 


Arg 


lie 


Leu Gly 


Gly 


Ser 


Gin 


Val 


Glu 


Lys 


50 








55 








60 








lie 


Gly Ser 


Tyr Pro 


Trp 


Gin 


Val 


Ser 


Leu Lys 


Gin 


Arg 


Gin 


Lys 


His 


65 






70 








75 










80 


Cys Gly 


Gly Ser 


He 


Val 


Ser 


Pro 


Gin Trp 


Val 


He 


Thr 


Ala 


Ala 


His 


85 








90 










.95 
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<210> 3 

<211> 909 

<212> DNA 

<213> Homo sapiens 



<400> 3 

atgcttataa 

aaatctgear 

cagccttgga 

ggttcctatc 

atcgtctcac 

tctactttga 

actctcacta 

tatgatattg 

atatgtcttc 

tggggecget 

attttgacct 

aagaccttcc 

ggaggttcac 

tggggtttgg 

cctgggatct 

ggtaactaa 



gcaggaacaa 
ctctttcgct 
attattttaa 
cctggcaggt 
cacagtgggt 
atgttactgc 
ttgaaactgt 
cccttttgaa 
cagagctgeg 
taactgaagg 
gggaagagtg 
tttgeacagg 
tcatgtgccg 
gctgtggtcg 
tcacagacat 



gctgatttta 
ccccaaagct 
cattttcagt 
atctctgaaa 
gatcaeggeg 
tggagagtat 
catcatacat 
gatggctgga 
ggagcaattt 
tggcgtcctc 
tgeggcaget 
ttttcctgat 
gaataagaaa 
aggctggaga 
tagtaaagtg 



ctactaggaa 
cccagttgtg 
cgcattcttg 
caaaggcaga 
gctcactgca 
gaettaagee 
ccacatttct 
gccttccaat 
gaggctggtt 
teacaagt'et 
ctgttaacac 
ggagggagag 
ggggcctgga 
aacaatgtga 
ctttcctgga 



tagtcttttt 
ggcagagtct 
gaggaageca 
agcatatttg 
ttgcaaacag 
agacagaccc 
ccaccaagaa 
ttggccactt 
ttatttgtac 
cgcaggaagt 
taaagaggee 
aegcatgtea 
ctctggctgg 
ggaaaagtga 
tccacgaaca 



tgaacraggt 
ggttaaggta 
agtggagaag 
tggaggaagc 
aaacattgcg 
aggagagcaa 
accaatggac 
tgtggggccc 
aactgeagge 
gaatctgect 
catcageggg 
gggagattca 
tgtgacttcc 
tcaaggatcc 
catccaaact 



<210> 4 
<211> 302 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Leu lie Ser Arg Asn Lys Leu lie Leu Leu Leu Gly He Val Phe 

15 10 15 

Phe Glu Arg Gly Lys Ser Ala Ala Leu Ser Leu Pro Lys Ala Pro Ser 

20 25 .30 

Cys Gly Gin Ser Leu Val Lys Val Gin Pro Trp Asn Tyr Phe Asn He 
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<210> 5 
<211> 495 
<212> EKA 

<213> Homo sapiens , 
<400> 5 '' ;•' . ' 

atggactatg atattgccct tttgaagatg' gctggagcct tccaatttgg ccactttgtg 60 
gggcccatat gtcttccaga gctgcgggag caatttgagg ctggttttat ttgtacaact .120 
gcaggctggg gccgcttaac tgaaggtggc gtcctctcac aagtcttgca ggaagtgaat 180 
ctgcctattt tgacctggga agagtgtgtg gcagctctgt taacactaaa gaggcccatc 240 
agtgggaaga cctttctttg cacaggtttt cctgatggag ggagagacgc atgtcaggga 300 
gattcaggag gttcactcat gtgccggaat aagaaagggg cctggactct ggctggtgtg 3.60 
acttcctggg gtttgggctg tggtcgaggc tggagaaaca atgtgaggaa aagtgatcaa 420 
ggatcccctg ggatcttcac agacattagt aaagtgcttt cctggatcca cgaacacatc 480 
caaactggta actaa 495 
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<210> 7 

<211> 1568 

<212> DNA 

<213> Homo sapiens 



<400> 7 

catacaccat 

ctagaagcta 

ttccatttca 

ggactggctc 

aactatgagt 

cttttttgaa 

gagtctggtt 

aagccaagtg 

tatttgtgga 

aaacagaaac 

agacccagga 

caagaaacca 

ccactttgtg 

ttgtacaact 

ggaagtgaat 

gaggcccatc 

atgtcaggga 

ggctggtgtg 

aagtgatcaa 

cgaacacatc 

agggccagag 

tacctctgtt 

agagtaaaat 

aatgaaaaat 

tcaattgatg 

attacaccga 

agttatca 



agtctcagac 
tctaatcatt 
ggtgtcgtga 
aaaggcttcc 
ctcaaaatgc 
craggtaaat 
aaggtacagc 
gagaagggtt 
ggaagcatcg 
attgtgtcta 
gagcaaactc 
atggactatg 
gggcccatat 
gcaggctggg 
ctgcccattt 
agtgggaaga 
gattcaggag 
acttcctggg 
ggatcccctg 
caaactggta 
acagcatcag 
ctcagtgtag 
atggtaaaaa 
gaatccatcc 
atatgctttc 
tggcaccact 



tcagtttcat 
tagtttgtct 
aaagctcgaa 
tatttttgtt 
ttataagcag 
ctgcarctct 
cttggaatta 
cctatccctg 
tctcaccaca 
ctttgaatgt 
tcactattga 
atattgccct 
gtcttccaga 
gccgcttaac 
tgacctggga 
cctttctttg 
gttcactcat 
gtttgggctg 
ggatcttcac 
actaagccat 
cagagtcctg 
cacacaagga 
aaaaaaaaaa 
atcctagacc 
accacttact 
tacttctcaa 



gggtgaaatg 
cgtttttttc 
ttcggcgcgc. 
tgctttagtc 
gaacaagctg 
t'tcgctcccc 
ttttaacatt 
gcaggtatct 
gtgggtgatc 
tactgctgga 
aactgtcatc 
tttgaagatg 
gctgcgggag 
tgaaggtggc 
agagtgtgtg 
cacaggtctt 
gtgccggaat 
tggtcgaggc 
agacattagt 
cacacaaggt 
gcaaatcaga 
ttgtgaggtt 
aaaaaaatgt 
ttctcaaact 
cctctggatt 
aaaaatccag 



gagaagatat 
tctggttcaa 
cagatatcac 
tctctaaaat 
attttactac 
aaagctccca 
ttcagtcgca 
ctgaaacaaa 
acggcggctc 
gagtatgact 
atacatccac 
gctggagcct 
caatttgagg 
gtcctctcac 
gcagctctgt 
cctgatggag 
aagaaagggg 
tggagaaaca 
aaagtgcttt 
taagaagctg 
gcacctgaac 
taccaagtct 
ggagcagcat 
ggcctttaat 
cagagtccag 
caaactataa 



tacctcaatc 

agtttttttc 

acgtgccaag 

ttcagggaaa 

taggaatagc 

gttgtgggca 

ttcttggagg 

ggcagaagca 

actgcattgc 

taagccagac 

atttctccac 

tccaatttgg 

ctggttttat 

aagtcttgca 

taacactaaa 

ggagagacgc 

cctggactct 

atgtgaggaa 

cctggatcca 

ccattctgct 

caacagtctc 

aaataaaaca • 

gcagacattc 

tgaaactatc 

agtgctcacc 

ccagatcagt 
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